Supporting User-Defined Functions on Uncertain Data
نویسندگان
چکیده
Uncertain data management has become crucial in many sensing and scientific applications. As user-defined functions (UDFs) become widely used in these applications, an important task is to capture result uncertainty for queries that evaluate UDFs on uncertain data. In this work, we provide a general framework for supporting UDFs on uncertain data. Specifically, we propose a learning approach based on Gaussian processes (GPs) to compute approximate output distributions of a UDF when evaluated on uncertain input, with guaranteed error bounds. We also devise an online algorithm to compute such output distributions, which employs a suite of optimizations to improve accuracy and performance. Our evaluation using both real-world and synthetic functions shows that our proposed GP approach can outperform the state-of-the-art sampling approach with up to two orders of magnitude improvement for a variety of UDFs.
منابع مشابه
Managing Continuous Uncertain Data by a Probabilistic XML Database Management System
Database systems are widely used in today’s world. Almost every information system contains one or more databases. From a traditional perspective, databases are used to store precise values about objects in the ’real world’. However, many information is uncertain or imprecise. Consider, for example, sensor applications. Sensors produce uncertain and imprecise data since readings of sensors are ...
متن کاملMethod of Collaborative Filtering Based on Uncertain User Interests Cluster
Recommender systems have been proven to be valuable means for Web online users to cope with the information overload and have become one of the most powerful and popular tools in electronic commerce. The suggestions provided are aimed at supporting their users in various decision-making processes, such as what items to buy, what music to listen, or what news to read. In the paper, we introduce ...
متن کاملارزشگذاری ویژگیهای موجودیتهای الگوی مفهومی اف. آر. بی. آر. از دیدگاه کاربران فهرستهای رایانهای
Purpose: The aim is investigating views of three groups of library users (non-professionals, specialized professionals, librarians) regardingthe importance and value of Attributes of entities of FRBR Conceptual model in supporting user tasks. Methodology: all attributes of entities of FRBR Conceptual model in supporting user tasks, was examined and evaluated through a descriptive-survey of th...
متن کاملAccess and Mobility Policy Control at the Network Edge
The fifth generation (5G) system architecture is defined as service-based and the core network functions are described as sets of services accessible through application programming interfaces (API). One of the components of 5G is Multi-access Edge Computing (MEC) which provides the open access to radio network functions through API. Using the mobile edge API third party analytics applications ...
متن کاملORION: Managing Uncertain (Sensor) Data
An important quality of sensor data is that it is often uncertain or imprecise. This uncertainty can be an inherent aspect of the data (e.g. due to known errors in the measuring device, such as the Gaussian error in GPS readings), or it may be introduced in order to achieve scalability [2, 1], or to ensure a certain level of privacy [4]. Existing database management systems provide virtually no...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- PVLDB
دوره 6 شماره
صفحات -
تاریخ انتشار 2013